PCluster: Probabilistic Agglomerative Clustering of Gene Expression Profiles

نویسنده

  • Nir Friedman
چکیده

A central problem in analysis of gene expression data is clustering of genes with similar expression profiles. In this paper, I describe an hierarchical clustering procedure that is based on simple probabilistic model. This procedure clusters genes with respect to a target classification of conditions, so that genes that are expressed similarly in each group of conditions are clustered together.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic agglomerative clustering of gene expression profiles

The increasing use of microarray technologies is generating a large amount of data that must be processed to extract underlying gene expression patterns. Existing clustering methods could suffer from certain drawbacks. Most methods cannot automatically separate scattered, singleton and mini-cluster genes from other genes. Inclusion of these types of genes into regular clustering processes can i...

متن کامل

Data Complexity in Clustering Analysis of Gene Microarray Expression Profiles

The increasing application of microarray technology is generating large amounts of high dimensional gene expression data. Genes participating in the same biological process tend to have similar expression patterns, and clustering is one of the most useful and efficient methods for identifying these patterns. Due to the complexity of microarray profiles, there are some limitations in directly ap...

متن کامل

TA-clustering: Cluster analysis of gene expression profiles through Temporal Abstractions

This paper describes a new technique for clustering short time series of gene expression data. The technique is a generalization of the template-based clustering and is based on a qualitative representation of profiles which are labelled using trend Temporal Abstractions (TAs); clusters are then dynamically identified on the basis of this qualitative representation. Clustering is performed in a...

متن کامل

Clustering Time-Series Gene Expression Data with Unequal Time Intervals

Abstract. Clustering gene expression data given in terms of time-series is a challenging problem that imposes its own particular constraints, namely exchanging two or more time points is not possible as it would deliver quite different results, and also it would lead to erroneous biological conclusions. We have focused on issues related to clustering gene expression temporal profiles, and devis...

متن کامل

PathCluster: a framework for gene set-based hierarchical clustering

MOTIVATION Gene clustering and gene set-based functional analysis are widely used for the analysis of expression profiles. The development of a comprehensive method jointly combining the two methods would allow for greater biological insights. RESULTS We developed a software package, PathCluster for gene set-based clustering via an agglomerative hierarchical clustering algorithm. The distance...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003